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The  Rationale  for  SDL 


String  Description  Language,  hereafter  referred  to  as  SDL,  is  a  language 
which  enables  one  to  describe  the  format  of  character  strings.  It  gives  one  the 
ability  to  precisely,  or,  if  desired,  imprecisely,  describe  a  generalized 
character  string.  For  example,  if  the  user  wishes  to  describe  all  character 
strings  which  consist  of  an  arbitrary  number  (from  1  to  n)  of  'A's  followed  by 
an  arbitrary  number  of  'B's,  he  may  describe  this  precisely  in  SDL  as  follows: 

<ABSTRING>  :  *AB'  !  ’A'*  <ABSTRING>  *B’» 

As  the  user  can  see,  this  is  a  rather  elegant  and  simple  description  of  such 
character  strings,  while  the  English  words  used  to  describe  such  strings  are 
somewhat  awkward. 

SDL  was  developed  by  the  author  as  a  result  of  his  lack  of  satisfaction 
with  other  string  description  languages  such  as  BNF  (Backus-Naur  Form)  and 
Extended  BNF.  SDL  is  a  combination  of  what  the  author  considers  to  be  the  good 
points  of  several  string  description  languages  and  the  features  he  finds  lacking 
in  them. 

Particularly,  the  features  carried  over  into  SDL  from  other  string 
description  languages  include  the  relatively  simple  form  of  the  Extended  BNF 
statements,  the  BNF  and  Extended  BNF  operators  (with  some  modification),  the 
form  of  constant  strings  employed  by  Extended  BNF,  and  recursion. 

Features  incorporated  into  SDL  which  are  not  found  in  BNF  and  Extended  BNF 
include  the  multiplicity  form  of  the  '*'  and  '+'  operators,  the  use  of 
SDL-defined  non-terminal  variables,  and  the  ability  to  imprecisely  define 
strings  using  an  English  description. 

SIX.,  as  described  in  this  document,  is  a  refinement  of  the  SDL  described  in 
the  author's  MS  thesis  "ARIAN  —  An  Implementation  of  a  Microcomputer  Operating 
System." 


SDL  Variables  and  Constants 


All  constants  in  SDL  are  string  constants,  which  take  the  form  of  zero  cr 
more  characters  of  the  ASCII  character  set  enclosed  in  single  quotes.  The 
double  single  quote  (")  is  used  to  represent  a  single  quote  oonstant.  Examples 
of  SDL  constants  are  'abc',  'hello',  '  'O',  'teB^ABC',  and  '•:bc1,.'. 

A  variable  in  SDL  is  a  symbol  which  may  be  assigned  a  string  value.  Two 
types  of  variables  are  used  in  SDL  —  terminal  and  non-terminal  variables.  A 
non-terminal  variable  is  a  string  of  alphanumeric  characters  enclosed  in  angle 
brackets  «»,  and  it  is  used  to  represent  certain  special  characters  (such  as 
carriage  return)  and  characters  or  strings  the  user  wishes  to  use  to  help  define 
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other  non-terminal  and  terminal  variables  in  his  SDL  equations.  A  terminal 
variable  is  a  string  of  alphanumeric  characters  (the  first  of  which  must  be 
alphabetic)  which  is  not  enclosed  in  angle  brackets.  Terminal  variables  are 
final  products;  they  may  not  be  used  to  help  define  other  variables. 
Non-terminal  variables  may  be  used  to  help  describe  themselves  in  their  SDL 
expression .  This  is  a  recursive  definition.  The  first  example  with  <ABSTRING> 
is  a  recursive  definition. 

The  underscore  (_)  can  be  used  in  variable  names  to  improve  readability;  if 
used,  its  location  in~the  variable  name  is  as  significant  as  the  location  of  any 
other  character  in  the  variable  name. 

SDL  defines  a  number  of  non-terminal  variables  which  may  be  used  in  SDL 
equations.  These  non-terminals  are: 


1.  <cr>  or  <CR>  —  the  ASCII  carriage  return 
character, 

2.  <bs>  or  <BS>  —  the  ASCII  backspace  character, 

3.  <lf>  or  <LF>  —  the  ASCII  line  feed  character, 

4.  <hull>  or  <NULL>  —  the  ASCII  null  character, 

5.  <erapty>  or  <EMPTT>  —  the  empty  or  null  string, 

6.  <bel>  or  <BEL>  —  the  ASCII  BEL  character  (ctrl-G) , 

7.  <ff>  or  <FF>  —  the  ASCII  form  feed  character, 

8.  <tab>  or  <TAB>  —  the  ASCII  tab  character. 


These  SDL-defined  non-terminal  variables  may  be  used  in  any  SDL  expression 
without  being  defined  elsewhere.  These  symbols  may  also  be  redefined  by  the 
user  at  his  discretion,  but  care  must  be  taken  not  to  think  of  their  original 
SDL  definitions  if  the  redefinition  of  the  variable  assigns  it  a  meaning 
different  from  its  SDL  meaning. 
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SDL  Operators 


SIX.  allows  the  user  to  employ  several  operators  in  the  SDL  expressions  he 
creates.  These  operators  are: 

Symbol  Name  of  Operator 

"  double  quote 
*  asterisk 
+  the  plus  symbol 

!  the  exclamation  mark 

?  the  question  mark 

...  the  mark  of  ellipsis 

(  the  left  parenthesis 

)  the  right  parenthesis 
:  colon 


The  uses  of  these  SDL  operators  are  as  follows: 

1.  double  quote  -  this  is  used  to  enclose  a 
description  phrase.  A  description  phrase  is  an  English 
phrase  used  to  describe  a  variable;  an  example  of  a 
description  phrase  is  "THE  DIGITS  0  AND  1". 

2.  asterisk  -  this  is  one  of  SDL's  two  multiplication 
symbols.  It  means  that  the  preceeding  symbol  or  expression 
may  be  repeated  zero  cr  more  times.  For  example,  <digit>* 
means  that  <diglt>  may  be  repeated  zero  cr  more  times.  If 
the  asterisk  is  followed  by  a  nunber,  the  nunber  specifies 
the  upper  limit  of  the  multiplication.  For  example, 
<digit>*4  says  that  there  may  be  zero  to  A  digits  in  the 
string. 

3.  the  plus  symbol  -  this  is  the  other  multiplication 
symbol.  It  is  used  exactly  like  the  asterisk,  but  it 
specifies  one  or  more.  The  number  following  it  may  also  be 
present,  indicating  an  upper  limit.  For  example,  <digit>+4 
says  that  there  may  be  from  1  to  4  <dlglt>s  in  the  string. 

4.  the  exclamation  mark  -  this  is  the  logical  OR 
operator  in  SDL.  Used  in  expressions,  it  means  that  the 
right-hand  side  or  the  left-hand  side  are  possible.  For 
example,  <alpha>  1  <digit>  says  that  whatever  is  being 
discussed  may  be  an  <alpha>  or  a  <dig.lt>. 

5.  question  mark  -  this  means  that  the  preceeding 
symbol  or  expression  is  optional.  For  example,  <alpha>? 
means  that  an  <alpha>  is  optional. 
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6.  the  aarks  of  ellipsis  -  implies  that  the  symbols  or 
characters  are  continuous  by  ASCII  order.  For  example , 

'A*  !  ...  1  *F*  refers  to  ’A',  'B',  *C\  ’D* ,  *E\  *F’. 

7.  parentheses  -  used  as  grouping  symbols.  They  may 
be  nested  as  deeply  as  desired. 

8.  colon  -  this  is  the  assignment  symbol.  It  must  be 
present  in  every  SDL  statement,  and  it  assigns  the 
ocpressicn  on  its  right  to  the  variable  on  its  left. 


SDL  Statements 


An  SDL  description  of  a  string  or  set  of  strings  consists  of  a  group  of  SDL 
statements.  An  SDL  statement  consists  simply  of  a  variable,  which  may  be  used 
to  represent  the  string  in  question,  followed  by  a  colon  and  an  SDL  expression. 
An  SDL  expression  is  a  grouping  of  variables,  operators,  description  phrases, 
and  ellipses  which  describes  a  string.  In  this  grouping  each  variable  is 
separated  by  one  or  more  delimiters,  where  a  delimiter  is  an  operator,  a  space, 
or  a  new  line. 


Examples 


The  following  is  a  group  of  examples  of  SDL  statements: 


1.  <digit>  :  ’O’  !  ...  !  *9* 

This  statement  defines  <digit>  to  be  any  of  the 
characters  *0’  to  *9’. 

2.  <hexdlgit>  :  <digit>  !  »A*  !  ...  !  'F' 

This  statement  defines  <hexdlglt>  to  be  a  <dlgit>  or 
one  of  the  characters  *A'  to  *F'. 

3.  HEX_ADDRESS  :  <hexdigit>»4 

HEX  ADDRESS  is  defined  as  a  string  of  from  0  to  4 
<hexdigl?>s.  For  example,  HEX  ADDRESS  may  be  "",  "ABC”. 
*0123",  "00". 

4.  NUM3ER  :  <digit>+4 

NUffiER  is  defined  as  a  string  of  from  1  to  4  <dlglt>s. 
Mote  that  NUMBER  and  HEX_ADDRESS  are  terminal  variables  and 
may  not  be  used  in  expressions  other  than  those  like  ”  VALUE 
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:  TERMINAL 

5.  ITIS  :  <alpha>+  ».•? 

ITZS  is  defined  as  a  string  of  one  or  more  <alpha>s 
followed  optionally  by  a  period. 

6.  ENGLISHJSENTENCE  :  "ANY  COMBINATION  OF  ASCII  CHARACTERS" 

ENGLISHJSENTENCE  is  defined  by  a  description  phrase. 

7.  THING  :  <digitN4  !  <hexdigit>+4 

THING  is  defined  as  being  either  from  1  to  4  <digit>s 
or  from  1  to  4  <hexdigit>s. 

8.  <neat>  :  *A'  !  *B'  !  'A»  <neat>  'B* 

<neat>  is  recursively  defined.  Strings  like  ’A',  'B* , 

*AAB* ,  • AAAABBB’ ,  and  1 ABBBEE1  are  possible. 

The  user  will  note  that  spaces  are  never  implied  in  SDL.  They  must 
placed  in  the  string  explicitly. 
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Sunnary 


SDL  is  a  recursive  String  Description  Language  which  is  used  to  define 
strings  of  ASCII  characters.  It  penults  implicit  and  explicit  definition  of 
strings,  and  it  can  be  generally  used  to  describe  almost  any  language  or  set  of 
strings.  Derived  from  BNF  and  Extended  BNF,  it  contains  many  features  from 
these  languages  as  well  as  several  extensions  made  by  the  author. 

Some  features  of  SDL  are: 


1.  SDL-de fined  non-terminal  variables  which  need  not 
be  defined  by  the  user. 

2.  The  use  of  ellipsis  to  indicate  items  left  out  of  a 
definition. 

3.  Multiplication  of  symbols  or  expressions  with  *  or 
+  may  be  followed  by  a  number  to  indicate  the  limit  of 
multiplication. 

4.  Recursive  definition  which  allows  a  variable  to  be 
defined  in  terms  of  itself. 

5.  The  use  of  a  description  phrase  in  a  variable 
definition. 


Like  BNF  and  Extended  BNF,  SDL  may  be  used  to  describe  context  free 
languages.  It  was  not  intended  to  be  used  as  such,  and  SDL  lacks  some  of  the 
formality  of  BNF  and  Extended  BNF,  but  the  value  of  this  application  is  present. 
SDL  was  designed  primarily  to  be  used  as  its  name  implies  —  to  describe 
character  strings,  and,  as  the  user  may  note,  a  computer  language  can  be 
referred  to  in  terms  of  character  strings. 
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